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Abstract 

Background: NaTrxh, a thioredoxin type h, shows differential expression between self-incompatible and 
self-compatible Nicotiana species. NaTrxh interacts In vitro with S-RNase and co-localizes with it in the extracellular 
matrix of the stylar transmitting tissue. NaTrxh contains N- and C-terminal extensions, a feature shared by thioredoxin 
h proteins of subgroup 2. To ascertain the function of these extensions in NaTrxh secretion and protein-protein 
interaction, we performed a deletion analysis on NaTrxh and fused the resulting variants to GFP. 

Results: We found an internal domain in the N-terminal extension, called N(3, that is essential for NaTrxh secretion 
but is not hydrophobic, a canonical feature of a signal peptide. The lack of hydrophobicity as well as the location 
of the secretion signal within the NaTrxh primary structure, suggest an unorthodox secretion route for NaTrxh. 
Notably, we found that the fusion protein NaTrxh-GFP(KDEL) is retained in the endoplasmic reticulum and that 
treatment of NaTrxh-GFP-expressing cells with Brefeldin A leads to its retention in the Golgi, which indicates that 
NaTrxh uses, to some extent, the endoplasmic reticulum and Golgi apparatus for secretion. Furthermore, we found 
that N(3 contributes to NaTrxh tertiary structure stabilization and that the C-terminus functions in the protein-protein 
interaction with S-RNase. 

Conclusions: The extensions contained in NaTrxh sequence have specific functions on the protein. While the 
C-terminus directly participates in protein-protein interaction, particularly on its interaction with S-RNase in vitro; the 
N-terminal extension contains two structurally different motifs: Na and N(3. N(3, the inner domain (Ala-17 to Pro-27), is 
essential and enough to target NaTrxh towards the apoplast. Interestingly, when it was fused to GFP, this protein 
was also found in the cell wall of the onion cells. Although the biochemical features of the N-terminus suggested 
a non-classical secretion pathway, our results provided evidence that NaTrxh at least uses the endoplasmic reticulum, 
Golgi apparatus and also vesicles for secretion. Therefore, the N(3 domain sequence is suggested to be a novel signal 
peptide. 
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Background 

Thioredoxins (Trxs) are widely distributed in nature 
from prokaryotes to eukaryotes. These proteins, which 
belong to the oxidoreductase thiol:disulfide superfamily 
[1], are characterized by the active site signature sequence 
WCXXC. This sequence motif constitutes the redox 
center mediating the isomerization of specific disulfide 
bridges on Trx target proteins [2]. In yeasts and mam- 
mals, the cytoplasmic Trx redox system is complemen- 
ted by a second Trx system within mitochondria. In 
plants, the system is more intricate due to the presence 
of chloroplastic Trxs that are strongly associated with the 
regulation of chloroplast metabolism and function [3]. In 
mammals and yeast, only two and three Trx-encoding 
genes, respectively, have been identified so far. In contrast, 
about 19 genes encoding Trxs are contained in Arabi- 
dopsis thaliana genome, recently reviewed in [4,5]. 

Trxs were initially described as reductants of ribonu- 
cleotide reductase during DNA synthesis [6,7]. Later, 
these proteins were shown to take part in a variety of 
important physiological processes, for example as electron 
donors for several biosynthetic oxidoreductases [8-10] or 
as protectants against oxidative damage by reduction of 
the disulphide bridges within many proteins. Interestingly, 
Trxs and Trx-related proteins are being found to be 
involved in several sexual plant reproduction processes 
as well, as reviewed in [11]. The functional diversity of 
Trxs correlates with their wide distribution in nature 
and with the large variability in their primary struc- 
tures (from 27% - 69% of identity among the amino 
acid sequences) [12]. Their features and functions have 
been recently reviewed [13,14]. 

Plant Trxs can be divided into eight types based on 
their sequence [15]. Types f, m, x, y, and z are localized 
in chloroplasts, type o is found in mitochondria, and 
type s is associated with the endoplasmic reticulum (ER) 
[2,15-19]. Information about the subcellular localization 
of type h (Trxs h), the largest group of this protein family, 
is limited since this group includes proteins located in 
the cytosol as well as in mitochondria and even secreted 
to the apoplast [20-22]. 

Plant Trxs are also involved in highly specialized bio- 
logical processes, including self-incompatibility (SI) in 
Brassica [23]. Two Trxs h proteins, THL1 and THL2, 
interact with the C-terminal domain of the S-locus 
receptor kinase (SRK), which is the female determinant in 
the sporophytic SI system in Brassica [24]. The formation 
of the SRK-THL complex occurs during self-compatible 
pollinations and it has been proposed that it prevents the 
SRK dimerization and self-phosphorylation; the last event 
is essential to the activation of the pollen rejection 
response [23]. Moreover, suppression of THL1 and 
THL2 in transgenic plants has shown that both Trxs are 
required for full pollen acceptance [25]. Trxs h also may 



play a role in the gametophytic S-RNase-based SI sys- 
tem in Nicotiana alata since NaTrxh reduces in vitro to 
the S-RNase, the female 5-determinant [22]. Moreover, 
the NaTrxh transcript is more abundant in SI species 
than in self-compatible ones from Nicotiana spp. [26]. 
In general, evidence indicating the involvement of Trxs 
and, in general, thiol/disulphide containing proteins within 
plant sexual reproduction processes is increasing, meaning 
that redox regulation plays a pivotal role in regulating 
these signalling mechanisms [11]. 

Trx h group is subdivided into three subgroups [27]. 
Subgroup 2 includes Trxs with an N-terminal extension. 
Some evidence suggests a role for this extension in Trx 
intracellular trafficking. In Populus tremula, the N-ter 
minus of PtTrx/z2 functions as a mitochondrial target- 
ing signal [21]. As with other subgroup 2 members, 
N. alata NaTrxh contains extensions toward its C- and 
N-termini, but their functions have not been investi- 
gated. Notably, NaTrxh does not possess a canonical 
signal peptide at its N-terminus but is secreted onto the 
extracellular matrix of the style [22]. Therefore, either or 
both the N- or C-terminus could be involved in NaTrxh 
secretion and/or mediate the protein-protein interaction 
of NaTrxh with its target proteins. 

Here, we show that NaTrxh secretion depends on an 
inner segment within its N-terminal extension. This seg- 
ment, Np, guides secretion of NaTrxh through the ER 
and Golgi. In addition, pull-down assays indicate that 
the C-terminal extension participates in the interaction 
with S-RNase. Likewise, in silico structure modeling 
predicts both the N- and C-terminal extensions to be 
solvent exposed and to fold into stable secondary structure 
elements. The model is consistent with an active role of 
both extensions in tertiary structure stabilization, with 
little or no effect on NaTrxh reductase activity. 

Results 

NaTrxh localizes to the extracellular matrix of the 
transmitting tissue in N. alata styles or associates with 
secretory pathway elements 

Previously, we demonstrated that NaTrxh co-localizes to 
the extracellular matrix (ECM) of the stylar transmitting 
tissue in N. alata along with the S-RNase [22]. Although 
it lacks a canonical signal peptide, NaTrxh contains 
sufficient information to guide its secretion, raising the 
possibility that this protein could follow a non-classical 
secretion pathway, as suggested by the Secretome 1.0 
algorithm [22]. Immuno-gold labelling and electron mi- 
croscopy data were consistent with an NaTrxh localization 
at the ECM of the same N. alata stylar tissue (Figure 1). 
Notably, a semi-quantitative analysis, counting all ob- 
served particles from five different micrographs at a 
12 K resolution, revealed gold particles to be associated 
with structures related to the secretory system (Figure 1A). 
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Figure 1 NaTrxh localized to the cell wall or associates to secretory elements in N. alata styles. (A) Semi-quantitative analysis of the 
localization of the gold particles (i.e., NaTrxh) by the electron microscopic immune-gold assays. Sub-panel shows NaTrxh was immunodetected in 
a stylar microsomal fraction (Mi) along with vATPase. E: crude protein extract. (B) NaTrxh was associated with vesicles (V), the trans-Golgi network 
(TGN), or in the cell wall (CW). (C - D) NaTrxh was mainly found associated to membranous systems, such as the endoplasmic reticulum (ER), the 
Golgi apparatus (G), or within vesicles. (E - F) Vesicles containing gold particles. In (f), a vesicle is observed fused to the plasma membrane (M). 
NaTrxh localization (arrows). Scale bars are shown in each micrograph. (B - F) Ultra thin sections of N. alata styles were treated with anti-NaTrxh 
and then with anti-rabbit coupled to gold particles. 



This association is consistent with the immune detection 
of both NaTrxh and the vATPase (marker) in the micro- 
somal fraction of a protein crude extract from N. alata 
styles (Figure 1A, sub-panel). In Figure IB, D, E, and F, 
gold particles (i.e., NaTrxh) are observed in association 
with vesicles, some of which reach the plasma membrane. 
These images are suggestive of membrane fusion leading 
to the extracellular release of the vesicle content, including 
NaTrxh (Figure IF), which also was found at the ECM, 
labelled as cell wall (CW; Figure IB, C, E, F). Figure 1C 
and D are representative micrographs where NaTrxh was 
found in association either with the ER or the Golgi. In 
contrast to the Secretome 1.0 algorithm prediction [22], 
our data show at least a fraction of NaTrxh travelling 
through the ER and Golgi secretory pathway en route 
to its final apoplastic localization in the styles of 
N. alata. However, as previously mentioned, NaTrxh 



lacks a canonical signal peptide, and the localization 
found through immuno-gold and electron microscopy 
provides cellular confirmation of secretion. 

NaTrxh N- and C-terminal extensions 

As previously reported [22] and shown in Figure 1, 
NaTrxh is secreted in N. alata styles. Contrary to the 
Secretome 1.0 algorithm, which predicts a non-classical 
secretion signal for NaTrxh, the hidden Markov algorithm 
[28] predicts a cleavage site between residues Ala-16 and 
Ala-17, albeit with a low probability (p = 0.593) [22]. 
Multiple alignment of several Trxs h from subgroup 2 
showed that the NaTrxh N-terminal extension sequence 
is at least 27 residues long (Additional file 1: Figure SI) 
and its C-terminal extension comprises residues E-136 
to Q-152 (Additional file 1: Figure SI). 
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Based on the above predictions, we divided the N- 
terminus of NaTrxh in two motifs: Net (from Met-1 to 
Ala-16) and N|3 (Ala-17 to Pro-27). The C-terminal ex- 
tension was defined starting at E-136 (Figure 2A). 

The N(i region is crucial for NaTrxh secretion 

To test if either extension is responsible for NaTrxh 
secretion, we generated NaTrxh deletion mutants lacking 
different sequence segments, fused to green fluorescent 



protein (GFP), and then transiently expressed them in 
onion epidermal cells. 

First, we showed that the full-length NaTrxh fused to 
GFP is observable in the extracellular space of onion 
epidermal cells (Figure 3A and B), as reported in 
N. benthamiana and A. thaliana [22]. The same was 
observed for the stylar ECM protein pll [29] fused to 
GFP (Figure 3C and D). We observed the same pattern 
when the Na motif is deleted from the N-terminus of 
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Figure 2 The Np motif is responsible for NaTrxh secretion. (A) An NaTrxh scheme indicating its N- and C-terminal extensions. The N-terminus 
was subdivided into two regions: Na (red) and N(3 (cyan). The C-terminus (green). (B) Transient expression of the different NaTrxh mutants fused to 
GFP in onion epidermal cells. (B-1) (B-2) Full-length NaTrxh. (B-3) (B-4) NaTrxhANa. (B-5) (B-6) NaTrxhAN(3. (B-7) (B-8) NaTrxhANctp. (B-9) (B-10) N(3 
motif (Ala-1 7 to Pro-27) directly fused to GFP. (B-1 1) (B-12) NaTrxhACOO. (B-1) (B-3) (B-5) (B-7) (B-9) (B-1 1) GFP fluorescence. (B-2) (B-4) (B-6) (B-8) 

(B-10) (B12). Bright fields merged with flourescence images. The cells were plasmolyzed with 1 M NaCI before confocal observation. CW: cell wall; M. 
plasma membrane. Scale bars = 50 um. 
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Figure 3 NaTrxh: GFP is secreted in onion epidermal cells. (A - B) GFP fluorescence from the NaTrxh: GFP fusion protein, was localized on the 
cell wall (CW). (C — D) pi 1 is a known secreted protein in N. data styles that was also secreted. (E - F) GFP alone was not secreted when transiently 
expressed. (G - H) Non-transformed cells. (A, C, E, G) GFP fluorescence. (B, D, F, H) Bright fields merged with flourescence images. M: plasma 
membrane; CW: cell wall; GFP fluorescent signal (arrows). The cells were plasmolized with 1 .0 M NaCI before confocal observation. Scale bars = 30 urn. 



NaTrxh (NaTrxhANa: GFP; Figure 2B-3 and 2B-4) and, 
therefore, concluded the Net domain is not required 
for targeting NaTrxh to the apoplast. However, when 
NaTrxhANctp, which lacks both the Not and N|3 motifs, 
was expressed as a GFP fusion protein, fluorescence was 
localized inside the cells, indicating that secretion was 
abolished (Figure 2B-7 and B-8). When the C-terminus 
was deleted from NaTrxh (NaTrxhACOO: GFP), GFP 
fluorescence was localized to the apoplast (Figures 2B-11 
and B-12). These data show that the N-terminal exten- 
sion carries all the information for NaTrxh secretion. 
However, in contrast to an orthodox N-terminal signal 
peptide, the first 17 amino acids are not required, as the 
inner N(3 domain promotes secretion in the absence of 
the Net segment. To test this hypothesis, we generated an 
NaTrxh protein mutant with the Net domain adjacent to 
the Trx core, deleting the N(3 domain (NaTrxhAN|3), and 
then expressed it as a GFP fusion protein. Transient 
expression of NaTrxhANp: GFP is shown in Figures 2B-5 
and B-6. GFP fluorescence can be observed within the 
cytosol. Furthermore, fusion to GFP of the N(3 domain 
alone leads to extracellular localization of the GFP signal, 
which resembles the distribution found for full-length 
NaTrxh (Figure 2B-9 and B-10). Together, these outcomes 



provide strong evidence that the N(3 domain is both essen- 
tial and sufficient for NaTrxh secretion. 

NaTrxh uses the endomembrane system to reach 
the apoplast 

While clearly sufficient to function as a secretion signal, 
the Np domain may guide NaTrxh secretion through 
an unorthodox secretion pathway. This possibility is 
suggested by the N|3 domain's unusual position within 
the primary structure (17 residues from the N-terminus) 
and the absence of a long hydrophobic amino acid region 
(Additional file 2: Figure S2). To evaluate if NpMed secre- 
tion proceeds via the ER, we looked for the presence of 
NaTrxh in the ER using two NaTrxh fusion proteins, 
NaTrxh:G FP (KDEL) and N|3: GFP(KDEL), both of which 
exhibit the ER retention signal KDEL [30,31]. As a control, 
we also fused pll to GFP(KDEL). pll is a known secreted 
protein from N. alata [29] with a typical signal peptide 
that is expected to follow the classical ER/Golgi pathway. 
The GFP signal from all GFP(KDEL) fusion proteins 
exhibits a typical ER distribution pattern surrounding the 
nucleus. The reticulate fluorescent pattern observed with 
both fusion proteins (Figure 4A-1 and A-4) and, inter- 
estingly, with the N|3: GFP(KDEL) as well (Figure 4A-7), 
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Figure 4 NaTrxh uses the ER/Golgi secretion elements to reach the apoplast. (A) Transient expression in onion cells of different proteins 
with the ER retention signal (KDEL) toward the C-termini. (A-1) (A-4) (A-7) GFP fluorescence. (A-2) (A-5) (A-8) Nucleus staining with propidium 
iodide. (A-3) (A-6) (A-9) Merged images. Scale bars = 20 urn. (B) Transient expression of pi 1 :GFP, NaTrxh:GFP and N(3: GFP in onion cells treated 
with BFA (50 ug/ml). (B-1) (B-3) (B-5) GFP fluorescence. (B-2) (B-4) (B-6) Bright fields merged with fluorescence images. The observations were 
made after plasmolysis with 1 M NaCI. CW: cell wall; M. plasma membrane. Scale bars = 50 urn. 



contrasts with the blurred pattern of the nucleus (Figures 4A- 
2, A-5 and A-8). These data are consistent with the passage 
of NaTrxh through the ER on its way out of the cell 
(Figure 4A). 



Evidence for participation of the Golgi network in 
NaTrxh secretion was obtained from treatment of onion 
epidermal cells with the fungal toxin Brefeldin A (BFA). 
BFA blocks vesicle formation at the Golgi network, which 
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prevents secretion of NapllGFP, NaThx:GFP, and N|3: 
GFP (Figure 4B). Additional evidence that NaTrxh is se- 
creted through vesicles is NaTrxh association with a mem- 
brane fraction (Figure 1A). Taken together, these results 
show that the N|3 domain is a hydrophilic novel internal 
signal able to promote NaTrxh secretion via the ER/Golgi. 



length NaTrxh, the NaTrxh variants show no differences 
in their ability to reduce insulin disulfide bonds using dithio- 
threitol (DTT) as an electron donor (Figure 5B) [7]. This re- 
sult demonstrates that the N-terminal extension functions 
in NaTrxh trafficking and, like the C-terminus, does not par- 
ticipate in NaTrxh's ability to reduce target proteins. 



The N-terminal region of NaTrxh accounts for structural 
stability but not for its reductase activity 

To evaluate whether the N-terminal extension, the C- 
terminal extension, or both extensions participate in 
NaTrxh reductase activity, we overexpressed four NaTrxh 
mutants as GST fusion proteins in Escherichia coli. The 
NaTrxhANoc, NaTrxhANctp\ and NaTrxhACOO proteins 
were recovered from the soluble phase from bacterial 
sonicates (Figure 5A), as reported for the full NaTrxh 
[22]. Notably, NaTrxhANP is only detected at the insol- 
uble phase (Figure 5A), suggesting that the protein does 
not fold correctly; therefore, its activity as a disulphide 
reductase could not be tested. When compared to full- 



N. alata S-RNase interacts in vitro with NaTrxh by its 
C-terminal region 

We previously reported the in vitro interaction of NaTrxh 
with the pistil S-determinant S-RNase from N. alata. The 
interaction takes place regardless of the NaTrxh redox 
state [22]. To test whether the N-terminal or C-terminal 
region accounts for this specific protein-protein inter- 
action, we prepared GSTNaTrxh-, GST:NaTrxhANoc-, 
GST:NaTrxhANa(3-, and GSTNaTrxhACOO-Affi-Gel affin- 
ity columns and passed through them extracellular stylar 
protein extracts from N. alata S W sS W s- 

Figure 6A shows that the Sios-RNase was retained in the 
NaTrxh-GST-Affi-gel matrix, as reported by Juarez-Diaz 
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Figure 5 NpS domain contributes to NaTrxh stability. N- and 
C-termini are not involved in its reductase activity. (A) SDS-PAGE 
analysis of different NaTrxh versions fused to GST expressed in £ coli 
cells. Upper panels: Coomassie blue stained gel; lower panels: 
western-blot immuno-stained with anti-NaTrxh antibody. S: soluble 
fraction; I: insoluble fraction; GST: gluthathione S-transferase; NaT: 
full-length NaTrxh; NaTAa: NaTrxhANa mutant; NaTA|3: NaTrxhANP; 
NaTAaP: NaTrxhANaP; NaTACOO: NaTrxhACOO. Arrows indicate 
the signal corresponding to the different NaTrxh:GST versions. 
(B) Thioredoxin activity assay using insulin as substrate and DTT as 
electron donor (Holmgren, 1979). Circles: NaTrxh; Diamonds: NaTrxhANa; 
Squares: NaTrxhANaP; Triangles: NaTrxhACOO; Crosses: only DTT. 
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Figure 6 The NaTrxh C-terminus contributes to the 
NaTrxh: S-RNase interaction. Pull-down experiments were 
performed using columns with the different NaTrxh versions 
[(A) NaT: NaTrxh; (B) NaTAa: NaTrxhANa; (C) NaTAaP: NaTrxhANaP; 
(D) NaTACOO: NaTrxhACOO]. Dotted lines indicate deleted regions. 
Stylar proteins from S105S105 N. alata were passed through each 
column, and then, each recovered fraction was analysed by western 
blot immune-staining with anti-S, 05 -RNase. UB: unbound fraction; 
W1, W5, and W10: first, fifth, and tenth washes, respectively, with 
binding buffer; Tw: binding buffer plus 0.1% Tween-20; NaCI 0.1 
and 0.2: washes with 50 mMTris, pH 7.9+100 mM or 200 mM NaCI, 
respectively; B1 and B2: bound fractions eluted with one and two 
bed volumes of elution buffer. 
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et al. [22]. Notably, we observed a similar binding behaviour 
when crude style extracts from N. alata 51055105 were 
passed through the NaTrxhANa and NaTrxhANa|3 
matrices (Figure 6B and C). Noteworthy, when the pro- 
tein extracts are passed through the affinity column with 
NaTrxhACOO, the Si 0 s-RNase is not retained (Figure 6D). 
These data show that the NaTrxh C-terminus contributes 
to the interaction with the Sios-RNase. 

The IMP domain plays a structural role in NaTrxh 

NaTrxh is predicted to interact with other trafficking- 
related proteins to be secreted. Thus, the Np domain is 
likely to be exposed at the molecular surface to facilitate 
such interactions. To support this hypothesis, we con- 
structed a model of NaTrxh using a combination of hom- 
ology modeling and molecular dynamic (MD) simulations. 
We used Modeller 9v4 [32] for the homology modelling 
and GROMACS 3.3.1 [33,34] for the MD simulations. 

While the closest homologue of NaTrxh with a known 
3D-structure is the Hordeum vulgare H2 Trx (2IWT), the 
N. alata protein possesses extensions toward its N- and C- 
termini, which has no homologues in the Protein Data 
Bank (PDB) [35]. We obtained a predicted conformation 
for these extensions by performing two rounds of MD sim- 
ulations. The structure shown in Figure 7E is a representa- 
tive conformation, drawn with visual molecular dynamics 
(VMD) molecular viewer [36]; mobile regions are shown in 
orange, blue and green. At the end of the second run, the 
N- and C-termini folded to form a "beta sheet hat" sepa- 
rated from the Trx core and opposite the putative reactive 



site loop (with the motif xCxPCx). The beta sheet was 
fully formed after 20 ns and remained stable thereafter. 
Only four segments in the protein showed significant 
fluctuation in the final model: the first 5 and the last 5 
residues, the loop where the reactive cysteine residues 
reside (60 to 66), and a loop connecting the core to the 
N-side of the "beta sheet hat" (residues 23 to 26). 

The model was rated from very good to fairly good by 
Atomic Non-Local Environment Assessment (ANOLEA) 
[37] (Figure 7B) and ProQ [38]. With the Rd.HMM proto- 
col [39], we used the coordinates of the backbone atoms of 
the model (after replacement of sequence information with 
random amino acid sequences) to retrieve the N. alata 
NaTrxh amino acid sequence from the NCBI nr protein 
database [40] with a statistical significance substantially 
higher than the one for the H. vulgare sequence (homology 
modeling template protein). In contrast, the 21 WT crystal 
as well as some of the initial models from Modeller, when 
subjected to the Rd.HMM protocol, scored the sequence of 
the barley protein and several other Trx h proteins with 
high probability, while the N. alata amino acid sequence 
was recovered with an E value above 1 (lacking statistical 
significance). According to its quality and appropriateness 
scores (see Methods), the model appears to be reasonably 
close to the reduced form of the N. alata NaTrxh 3D struc- 
ture. The appropriateness score is worth noting because 
the Rd.HMM is known to be very sensitive, which may re- 
sult in false negatives (i.e., the model is rejected even when 
it may be an approximate description of the native-like 3D 
structure). However, no false positives have been found yet. 
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Interestingly, in all models produced, the N-terminus 
remains accessible to the solvent, especially the region 
corresponding to residues 20 to 28, which coincides with 
the Np domain. In addition, the final conformations of 
the N- and C-termini anchors and the N-terminal exten- 
sion to the hat (Figure 7E) forces the poorly ordered 
loop of amino acids from residues 23 to 26 to remain on 
the protein surface. Since this region seems to be suffi- 
cient for NaTrxh secretion, its anchorage may facilitate 
the recognition of this sequence by some unidentified 
component of the secretion pathway. To assess the 
potential of the NaTrxh extensions to interact with other 
proteins, we compared them to intrinsically disordered 
regions (IDRs). The amino acid sequences known as in- 
trinsically disordered proteins (IDPs) or IDRs, among 
other names, are proteins or partial regions of proteins 
that lack stable and well-defined 3D structures under 
physiological conditions in vitro [41-43]. We identified 
the IDRs using DisEMBL [28]; server at [44], which re- 
lies on three criteria to assign an amino acid sequence 
as disordered: loops/coils, hot loops, and remark465. 
The loops/coils definition identified residues 1 to 47, 59 
to 70, 75 to 86, and 135 to 152 as IDRs (Figure 7C); hot 
loops reported segments 19 to 26 and 118 to 152 as 
IDRs (Figure 7B); and remark465 defined the first 28 N- 
terminal residues as the only IDR in NaTrxh (Figure 7A). 
According to DisEMBL, the NaTrxh extensions are 
IDRs, and all three criteria agree with MD simulations in 
the prediction of the N[3 region (Figure 7D) as a poorly 
structured protein segment. 

Discussion 

Here, we demonstrated that the Nf> domain (A 17 EAESGSS- 
SEP27) is required for NaTrxh secretion. Additionally, 
we provided evidence on NaTrxh targeting to the apo- 
plast via the ER/Golgi regardless of the absence of a 
distinguishable hydrophobic signal peptide. Finally, we 
also present data that suggest that the C-terminal region 
of NaTrxh is an important mediator of the NaTrxh:S-RNase 
interaction. 

NaTrxh is transported through vesicles toward 
the apoplast 

The immune assays we performed clearly show that 
NaTrxh is mainly localized to membranous bodies, pri- 
marily vesicles, which correlate with the finding of 
NaTrxh in the microsomal fraction. These data strongly 
indicate that NaTrxh is carried to the extracellular space 
by means of a vesicle-dependent secretion pathway. The 
electron microscopy data clearly place NaTrxh inside 
vesicles (Figure 1), although some gold particles were 
observed to be associated with ER and other unidentified 
membranous systems, we cannot affirm that these vesi- 
cles come from the ER, the Golgi, or both. 



Although NaTrxh lacks a canonical signal peptide, its 
association with vesicles correlates well with its extracel- 
lular localization. A possible secretion mechanism for 
proteins of this kind relies on their direct interaction 
with secretion vesicles without previous association to 
the ER/Golgi [45]. In mammalian cells and yeast, some 
proteins are secreted through an ER/Golgi-independent 
pathway. Such is the case of insulin degrading enzymes 
[46], interleukins IL-lb and IL-18 [47], and some yeast 
proteins lacking a signal peptide [45]. 

NaTrxh has an internal and hydrophilic secretion signal 
and is secreted via the ER/Golgi 

The symplastic localization of the NaTrxhANa mutant 
(Figure 2B-3, B-4) shows that the first 17 N-terminal 
residues are not essential for NaTrxh secretion. Instead, 
the internal amino acid sequence A17EAESGSSSEP27 
(N|3), despite lacking the characteristic hydrophobicity 
(Additional file 2: Figure S2) shown on classical signal 
peptides [48], is essential and sufficient for NaTrxh se- 
cretion, as observed by the cytoplasmic localization of 
the NaTrxhAN|3 mutant (Figures 2B-5, B-6). This motif 
is also sufficient to direct the N[3 GFP-tagged to the 
extracellular space (Figures 2B-9, B-10). Most proteins 
secreted through the ER/Golgi pathway are translated 
in ribosomes attached to the ER membrane and pos- 
sess a signal peptide localized at the N-terminus [48]. 
One important property of such signal peptides is their 
hydrophobicity [48]. This feature is essential for recog- 
nition of the nascent peptide by the signal receptor 
particle (SRP) [49]. Although our data indicate that 
NaTrxh passes through the ER and Golgi en route to 
the apoplast — as shown with the KDEL constructs 
(Figure 4A), the blocking of NaTrxh secretion by BFA 
(Figure 4B), and the presence of NaTrxh in the micro- 
somal fraction (Figure 1A) — we do not know how 
NaTrxh is transported into the ER and how the Golgi 
participates in its secretion. Although several possible 
scenarios are feasible, we currently have no evidence to 
favor any of them. Two examples that support secre- 
tion of proteins without a conventional signal peptide 
and using the endomembrane system are the proteins 
IL-1[3 and AcbA (acyl-coenzyme A-binding protein) 
[50]. IL-1(3 joins secretory lysosomes and is released 
when those lysosomes fuse with the plasma membrane 
[51,52]. IL-1|3 also can be captured directly into multi- 
vesicular bodies or be sequestered by autophagosomes 
and fuse with multivesicular bodies [52,53]. 

Non-classical secretion of cytoplasmic plant proteins 
has also been documented, as reviewed in [54,55]. It has 
been demonstrated that proteins without signal peptide, 
such as celery mannitol dehydrogenase in A. thaliana, 
traffic to the apoplast while bypassing the classical ER/ 
Golgi secretion pathway [56]. Another example is the 
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hygromycin phosphotransferase in A. thaliana, which is 
secreted through a Golgi-independent route mediated by 
the Golgi-localized synaptotagmin 2 [57]. However, this 
is unlikely to be the case for NaTrxh since our data 
clearly showed that it goes through both ER and Golgi 
for its secretion (Figure 4). 

Another possible route that NaTrxh could follow to 
the apoplast is through specialized vesicles, such as the 
exosome-like nanovesicles described in Olea europea 
pollen tubes, called pollensomes [58]. Some of the pol- 
lensomes are proposed to be ER- and Golgi-derived vesi- 
cles based on the fact that Ole e 1 from O. europea was 
found to be within these pollensomes [58]. Regarding 
NaTrxh, we observed that some of it was contained in 
cytoplasmic vesicles and some of them were observed 
fused to the plasma membrane (Figure IE and F). In the 
apoplast, NaTrxh was never found associated to any 
exosome-like structure, as described for pollensomes 
[58] (Figure 1). 

An additional possibility is that NaTrxh could associ- 
ate to endomembrane systems through lipidic modifica- 
tions. Actually, Traverso et al. [59] found that NaTrxh is 
in vitro myristoylated at Gly-2, suggesting that NaTrxh 
may be a membrane-associated protein in planta. Based 
on this, it was speculated that it could be the manner 
about how NaTrxh is transported to the apoplast [59,11]. 
However, this scenario appears to be unlikely to occur 
because our deletion analysis outcomes indicated that 
the first 16 amino acids (the Na motif) are not es- 
sential for NaTrxh secretion, instead it was the inner 
domain, the Np motif, the one that directly led its 
secretion (Figure 2). 

The N(3 motif is apparently exclusive to plant Trxs. 
Besides NaTrxh, a similar motif has been found in only 
two soybean thioredoxins (Trxh2 and Trxhl) that, not- 
ably, are associated with the plasma membrane. Both 
soybean Trxs have an N-terminal extension [60] that 
includes a region with a high similarity index to the N(3 
sequence (Additional file 3: Figure S3). 

Our cell biology data along with our molecular assays 
by transient expression of different versions of NaTrxh 
fused to GFP indicate that NaTrxh secretion is due to its 
N(3 motif and that the protein follows a secretion path- 
way that requires the ER, the Golgi apparatus, and secre- 
tion vesicles. How NaTrxh interacts with these secretory 
elements is not known since the NaTrxh N-terminus 
does not have any of the typical signal peptide bio- 
chemical properties. However, the absence of an ortho- 
dox signal peptide in NaTrxh reveals the existence of 
an alternative secretion mechanism that uses, to some 
extent, the ER/Golgi secretory pathway. The accurate 
mechanism that leads NaTrxh secretion needs to be 
clarified and future research will be of great interest in 
order to unravel possible novel plant trafficking routes. 



NaTrxh:S-RNase interaction 

Protein and mRNA levels of NaTrxh are higher in the 
styles of SI plants than in self-compatible plants, and 
S-RNase interacts with NaTrxh in vitro. These facts 
have been used to classify NaTrxh as a pistil modifier gene 
that accounts for pollen rejection in N. alata [22,26]. 

This work contributes to our understanding of the 
molecular mechanism mediating the NaTrxh: S-RNase 
interaction. The pull-down experiments show that the 
NaTrxh C-terminal extension (E-136 to Q-152) is es- 
sential for its interaction with S-RNase (Figure 6D). 
However, this region does not affect NaTrxh secretion 
(Figure 2B-11, B-12) or Trx activity (Figure 5B). There- 
fore, it appears that NaTrxh is able to fold correctly in 
the absence of the C-terminal domain or at least fold 
well enough to sustain its native-like reductase activity. 

In N. alata, several proteins are directly involved in 
pollen rejection. In this species, S-RNase degrades the 
pollen tube RNA and determines sexual incompatibility 
on the female side. The NaTrxh:S-RNase interaction 
could be relevant to the SI response. NaTrxh likely stabi- 
lizes S-RNase or inhibits its ribonuclease activity in the 
pollen tube. Indeed, Oxley and Bacic [61] showed that 
S-RNase ribonuclease activity is affected by redox state 
in vitro. However, the redox state of NaTrxh does not 
impair its interaction with S-RNase [22]. 

S-RNase forms complexes with other stylar proteins 
(i.e., 120 K, pll, NaTTs) [62]. While 120 K is known to 
be essential for SI in N. alata [63], the precise function 
of these protein complexes in SI is still unclear. It is 
possible that NaTrxh participates as an associating 
factor to transport such as S-RNase, 120 K, NaTTs or 
pll to the pollen tube or, alternatively, to release these 
proteins from S-RNase complexes once inside the 
pollen tube. Both scenarios may be possible since a 
redox change by NaTrxh could play an important role 
for modifying S-RNase and stylar protein complexes. It 
has been determined that one of the targets of NaTrxh 
is actually S-RNase [22]. Therefore, further research is 
needed to determine if NaTrxh is a modifier factor in 
N. alata SI by altering the S-RNase redox state in 
planta. Although the data presented here are consistent 
with a role of NaTrxh in pollen rejection in SI Nicotiana 
species, loss of function assays would provide direct evi- 
dence of this role. 

Finally, homology modeling to predict the 3D struc- 
ture of Trx h revealed high sequence similarity between 
the H. vulgare and N. alata Trxh proteins, including 
conservation of the reactive site loop. The quality of the 
predicted model indicates similarity at the structural 
level too. The barley Trx h protein plays an important 
regulatory role during seed germination [64], and one of 
its targets is the barley ct-amylase/subtilisin inhibitor, a 
homologue of the SI modifier N. alata NaStEP protein 
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recently described by our group [65]. Both NaStEP and 
H. vulgare inhibitor proteins share extensive structural 
similarity; the loop equivalent to the one mediating the 
interaction of the barley proteins is present in NaStEP 
but is longer and has four cysteine residues instead of 
two [65] . An interaction of NaTrxh and NaStEP at some 
point along the physiological events regulating pollen rejec- 
tion in Nicotiana is a possibility worth future consideration. 

Conclusions 

Thioredoxins type h clustered within the subgroup 2 
contain non-conserved extensions towards the N- and/or 
the C-termini, which function is still unclear. In this work, 
we showed that the N-terminal extension of NaTrxh, a 
subgroup 2 Trx h from N. alata, contained the sufficient 
information to lead its secretion towards the apoplast. 
Interestingly, this extension contains two distinguishable 
motifs, called Not and N|3, divided by the hidden Markov 
algorithm [28] prediction of a cleavage site on the Ala- 16 
position. While the Na domain appeared unlikely to have 
any function on NaTrxh secretion, the N[3 was the respon- 
sible for its particular subcellular localization. 

The NP domain is the only sequence necessary for 
NaTrxh secretion. Transient expression experiments in 
epidermal onion cells of the N(3-GFP fusion protein 
revealed its localization in the apoplast, as occurred with 
the full NaTrxh protein fused to GFP and with the 
NaTrxhANoc and NaTrxhACOO mutants as well. These 
data corroborated the N[3 function as a novel signal pep- 
tide since its primary structure position and hydrophilic 
profile do not follow the typical biochemical features of 
the classical transport sequences. 

The Nf> biochemical features suggested an ER/Golgi 
independent secretion pathway. However, NaTrxh was 
detected in a microsomal fraction and, furthermore, was 
immune-detected mainly associated to classical secretory 
elements when observed within the cells in N. alata 
styles, providing evidences that the NaTrxh could be 
secreted through the classical ER/Golgi pathway or at 
least, it uses the elements of this route. This hypothesis 
was also tested by fusing the ER retention signal KDEL to 
NaTrxh-GFP and treating the cells, when expressing the 
fusion protein NaTrxh-GFP, with BFA, separately. In the 
first case, the NaTrxh-GFP(KDEL) was found associated 
to the ER and in the latter, the NaTrxh-GFP secretion 
was abolished, confirming that the NaTrxh actually goes 
through these organelles in order to be secreted. Fur- 
thermore, it was also found that the N(3 domain played 
an important structural role on the NaTrxh tertiary 
structure stability since the NaTrxhAN(3, which only 
lacks the NP domain, was detected in inclusion bodies 
when overexpressed in E. coli cells and not in the soluble 
one as the wild type and the other NaTrxh versions did 
(i.e., NaTrxhANa, NaTrxhANaP and NaTrxhACOO). 



Regarding the C-terminus, it was found to be essential 
for the NaTrxh-S-RNase in vitro interaction, since the 
S-RNase was unable to bind to a NaTrxhACOO-containing 
column. 

Finally, the in silico analysis showed that the NaTrxh N- 
and C-termini are solvent exposed, suggesting a protein- 
protein interaction role. While this function appears to be 
essential for the S-RNase interaction, it also provided 
evidences on the Np domain, which should interact ei- 
ther with a non-identified secretory element or interact 
in a different manner as the classical secreted proteins 
do with SRP. Interestingly, the N-terminal extension 
clearly showed two structural motifs, which coincide 
with the Na and NP domains tested in this work. 

Methods 

Plant materials 

Self-incompatible (SI) Nicotiana alata SiosS'ios has been 
described previously [22,66-68]. 

GST fusion proteins, overexpression and purification 
from E. coli 

The NaTrxhANa and the NaTrxhANaP cDNAs were 
generated with Bam-Hl and £co-RI flanking sites using 
5 -GCGCGGArCCATGGCAGAGGCAGAATCAG-3 ' and 
5 '-GCGG>irCC4rGTCGCGTGTGATTG-3 ' as sense 
primers, respectively, and 5 -GCGCGCGGGA4_rrC 
AATTTATTGGACATGAAA-3 ' as the antisense pri- 
mer for both mutants. For the NaTrxhACOO mutant, 
we used 5 -CGCGCGGATCCATGGGATCGTATCTT 
TCAA-3' as the forward primer and 5 -CCG GAATTC 
CCTGTGCTTGAGAATCTTTTTCTCGAG-3 ' as the 
reverse primer with Bam-Hl and £co-RI sites, respect- 
ively. The NaTrxhANP cDNA with Bam-HI and £co-RI 
sites was generated by two sequential PCRs. For the 
first PCR, the forward primer was 5 -TCGCGTGTGA 
TTGCTTTTCATTCTTCCAAT-3'. The PCR product 
from the first amplification was used as template for 
the second PCR, with 5 -GG^irCCATGGGATCGT 
ATCTTTCAAGTTTGCTCGGTGGAGGCGCGGCGG 
AAGCGTCGCGTGTGAT-3' used as the sense primer. 
The reverse primer in both amplification steps was the 
same as that used for the other N-mutants. 

For overexpression of the mutant forms of NaTrxh, each 
cDNA was cloned into pGEX 4 T-2 (pGEX) (Amersham 
Biosciences) in E. coli BL21(DE3)pLysS cells (Stratagene), 
as was previously described for the wild type NaTrxh [22] . 

The GSTNaTrxh, G STNaTrxh ANa, GSTNaTrxhANp, 
GSTrNaTrxhANap, and GST:NaTrxhACOO fusion 
proteins were overexpressed in E. coli cultures at an 
OD 600 of 0.5 - 0.7 by adding 0.1 mM IPTG and incubat- 
ing for 3 - 5 h at 37°C. The proteins were separated by 
batch affinity chromatography using glutathione agarose 
(Sigma). 
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Constructs for NaTrxh versions, NB and p1 1 fused to GFP 

NaTrxh and NaTrxhAN|3 cDNA with attBl and attB2 
sites were generated by using the following primers: 
forward, 5 ' -GGGGACAAGTTTGTACAAAAAAGCAGGC 
7TCATGGGATCGTATCTTTCAAGTTTG-3 '; reverse, 
5 ' -GGGGACCACTTTGTACAAGAAAGCTGGGTCTTGG 
ACATG AAATTTAGTTCG ATA-3 ' . The pGEX::NaTrxh 
and pGEX::NaTrxhAN|3 constructs were used as tem- 
plates, respectively. The PCR products were cloned 
into pDONR/Zeo (Invitrogen) by recombination with a 
BP Clonase (Invitrogen), following the manufacturer's 
instructions. 

The NaTrxhANa, NaTrxhANa|3, and pll cDNAs were 
generated by PCR using the forward primers 5 -CACC 
^TGGCAGAGGCAGAATCAGGA-3 ', 5 ' - CA CCA TGTC 
GCGTGTGATTGCTTTT-3 ' and 5 ' -CACCArGTCAGG 
AAAAC AAG GG TCTG C A AT T T TAT G - 3 ' , respectively. 
For the NaTrxhANa and NaTrxhANap clones, the 
reverse primer was 5 -TTATTGG ACATG AAATTTAG 
TTCGATAATTACTAGCAGC-3 '. For pll, the reverse 
primer was 5' TTTGTTTGTTAACTTAGCAGTAAC 
TGAAATCTTTTGGCC 3'. All cDNAs were cloned 
into pENTR/D-TOPO (Invitrogen), following the man- 
ufacturer's instructions. 

NaTrxhACOOcDNA was cloned into pENTR/D-TOPO 
by using 5 ' -CACCTGGGATCGTATCTTTCAAGT-3 ' and 
5 ' -TCATTCCCTGTGCTTGAGAATCTT-3 ' as sense and 
antisense primers, respectively. 

To generate the Nji sequence, the DNA fragment 5'-C 
ACCArGGCAGAGGCAGAATCAGGATCGTCGTCAG 
AACCG-3' was aligned with its complement by mixing 
both primers and incubating at 94°C for 15 min and 
then for 30 min at room temperature before proceeding 
with cloning into pENTR/D-TOPO (Invitrogen), follow- 
ing the manufacturer's instructions. 

All NaTrxh sequences (including Nfi) were transferred 
by recombination by LR recombinase enzymatic mixture 
(Invitrogen) to pEarleyGatewayl03 (C-GFP-HIS) (pEG103) 
[69], following the manufacturer's instructions. In the plas- 
mid, the gene of interest is translated with GFP fused to its 
C-terminus under the control of the 35S cauliflower mosaic 
virus (35S) promoter. 

The NaTrxh:GFP(KDEL), Nji: GFP( KDEL ), and 
pll-.GFP(KDEL) constructs were generated by PCR using 
the corresponding pEG103 construct as a template. 
The forward primers were the same as described above for 
each construct and the reverse was 5' TCAAAGCTC 
^rCITIGTGGTGGTGGTGGTGGTGGCTAGC 3', which 
encodes the amino acids KDEL fused to the C-terminus 
of GFP from pEG103. 

Transient expression assays in onion epidermal cells 

Each fusion construct 35S:NaTrxh-GFP was individually 
bombarded into onion epidermal cells [70]. After 24 h 



of particle bombardment, onion epidermal cells were 
plasmolyzed by incubation in 1 M NaCl for 10 min. 
Fluorescence was visualized using an Olympus FV 1000 
confocal microscope with 485/545 nm excitation/emis- 
sion light for GFP and 570/670 nm for propidium iodine 
(Sigma), which was used to stain the nucleus. 

Brefeldin A (BFA) treatment 

Bombarded onion epidermal cells were incubated in 
50 ug/ml Brefeldin A (BFA; Sigma) for 30 min at room 
temperature before observation under a confocal micro- 
scope [71]. 

Protein assay 

Protein concentrations were determined as described else- 
where [72] and using bovine serum albumin as standard. 

Reductase activity assay 

The ability of the soluble NaTrxh mutants to reduce 
insulin disulphide bonds was evaluated as previously 
described [7] and compared with the reductase activ- 
ity of the wild-type recombinant NaTrxh [22]. In 
these assays, 2.5 ug of purified Trx protein were used. 

Protein gel blot analysis and immunostaining 

Proteins were fractionated by 12.5 % SDS-PAGE, blotted 
onto nitrocellulose, and then immunostained with poly- 
clonal anti-GST (1:10,000 dilution), anti-NaTrxh (1:1,000) 
[22], or anti 5i 05 -RNase (1:10,000 dilution) [29]. 

Affi-gel affinity columns and pull-down assays 

Purified recombinant GST fusion proteins (30 mg) 
were immobilized on Affi-gel- 10 (Bio-Rad), following 
the manufacturer's instructions. The NaTrxh rec -Affi- 
gel affinity column used was the same as previously 
reported [22]. 

Protein crude extracts (1 mg) from N. alata S105S105 
were obtained in a binding buffer (BB: 50 mM Tris-HCl, 
pH 7.9) and passed over each GST-NaT rxhAffi-gel col- 
umn. After recovering the unbound fraction, ten bed 
volume washes were done with the BB. The column was 
sequentially washed as follows: (a) BB plus 1 % Tween-20, 
(b) BB plus 0.1 M NaCl, and (c) BB plus 0.2 M NaCl 
where indicated. Tightly bound proteins were eluted with 
a 50 mM glycine plus 50 mM NaCl, pH 2.6 buffer. The 
samples were neutralized by the immediate addition of 
1 M Tris. Fractions were concentrated by cold acetone 
precipitation and analyzed by SDS-PAGE and western 
blot, as described above. 

Multiple alignment analysis 

Amino acid sequences of plant Trxs were aligned by 
ClustalW [73]. The GenBank accession numbers used 
are as follows: from Trx h subgroup 1, A. thaliana 
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(S58118 and S58119), Brassica napus (Q39362), B. oleracea 
(P68176), N. tabacum (Q07090 and P29449), and Oryza 
sativa (D26547); and from Trx h subgroup 2, A. thaliana 
(AAD39316, AAG52561 and S58123), Ipomoea batatas 
(AY344228), and N. alata (DQ021448). 

Modeling of the N. alata h2 thioredoxin 

BLAST sequence analysis revealed h2 thioredoxin from 
barley (H. vulgare; PDB id 2IWT) as the closest 
homologue to the N. alata thioredoxin NaTrxh. With 
Modeller 9v4 [74,75], several possible alignments were 
then used to produce up to 100 initial homology 
models of NaTrxh. After evaluation using ANOLEA, 
the best models were recombined in Modeller, and the 
selection repeated. The best model was then used for a 
Molecular Dynamics (MD) simulation run at 35 ns in 
an octahedral water box (1.2 nm distance from the 
walls to the protein) with 0.15 M NaCl, at 303 K 
(Berendsen thermostat, with velocity rescaling) and 
standard pressure (Berendsenbarostat), using GRO- 
MACS v4.5 [33] and the GROMOS G53a6 force field 
[76]. After a 30 ns simulation, the radius of gyration 
and root mean squared deviation (rmsd) changes of 
the protein reached a plateau, and only the first 30 and 
the last 20 amino acids appeared partially unfolded. 
The most representative conformer was recovered with 
clustering from 30 to 35 ns, minimized, and the MD 
simulation repeated. Again, after clustering from 30 to 
35 ns and minimization, a final model was produced. 
The last 5 ns of MD simulation were used to estimate 
the local pre-residue fluctuations (rmsf) as a tool to 
identify poorly structured regions in the protein. For 
this particular case, the repeated MD simulations con- 
verged better to a stable model than the classic simu- 
lated annealing MD. The reason behind this was not 
pursued further. 

The final model was minimized again using the RO- 
SETTA relax-fast protocol [77] and its quality evaluated. 
The ANOLEA total energy [37] is -1184.276 E/kT units, 
with positive energies (sum 41.691 E/kT units) in 12 of 
52 residues (Figure 7). The ProQ scores [38,78,79] are 
LG 2.910 (very good) and MaxSub 0.301 (fairly good). 
MaxSub has been compared to other scores and was 
found to be a reliable quality score for homology-based 
models [79]. The biological appropriateness of the final 
model was rated using the Rd.HMM protocol [39]. The 
search of amino acid sequences compatible with the final 
model on the NCBI sequence databases retrieved 62 
sequences. The top score corresponded to the NaTrxh 
amino acid sequence (GenBank accession AAY42864.1, 
HMM score 124, E-value 6.6 x 10 31 ). The original X-ray 
template is at position 29 with an HMM score of 20. In 
the remaining sequences, 45 were Trxh sequences, 15 
lacked functional annotation, and none were annotated 



with an alternative function. The ratio of the HMM 
score to the NaTrxh sequence length is 0.84, which is in- 
dicative of a very good model [39]. The HMM alignment 
is consistent with an optimal threading of the amino 
acid sequence to the NaTrxh 3D model. From these 
data, the 3D coordinates of the NaTrxh predicted model 
could be rated as highly appropriate to host the NaTrxh 
amino acid sequence. In contrast to other quality as- 
sessment methods for protein structural models, the 
Rd.HMM protocol does not give false positives; there- 
fore, the model may be regarded as a highly reliable 
prediction. 

Disordered regions of NaTrxh 

We used DisEMBL [44,80] to predict, in silico, intrin- 
sically disordered regions (IDRs) in the NaTrxh amino 
acid sequence. DisEMBL reports three different index 
values for assignment of IDRs: loops/coils, hot loops, 
and remark 465. Loops/coils indicate residues found 
within loops that are not necessarily disordered; hot 
loops indicate highly dynamic loop regions that should 
be regarded as disordered; and remarl<465 labels the 
Protein Data Base (PDB) records with missing coor- 
dinates in the X-ray structure, accordingly. The Dis- 
EMBL index was trained with PDB data to predict 
highly mobile regions likely to produce a poorly de- 
fined electron density in the would-be X-ray data [80]. 

Immuno-gold and transmission electron microscopy 

N. alata flowers were emasculated 48 h before anthesis 
and then collected as they reached maturity. Tissue 
was cut 2 mm below the stigma and fixed in 3% para- 
formaldehyde and 0.5% glutaraldehyde in 0.1 M PBS 
for a minimum of 4 h at 4°C. Tissue was rinsed in 
0.1 M PBS three times for 10 min. The fixed tissue was 
dehydrated in a series of ethanol concentrations (30%, 
40%, 50%, 60%, 70%, 80%, 90%, 96%, 100%) and infil- 
trated with LR-white resin by dipping it into solutions 
of increasing concentration (25%, 50%, 75%, 100%). 
The selected regions were then cut into ultra thin 
sections and placed in nickel grids. For immunolo- 
calization, sections were blocked with TBST buffer (20 
mM Tris pH 7.6, 150 mM NaCl, 20 mM sodium azide, 
1% Tween 20, 5% BSA) for 1 h at room temperature 
and then incubated with a rabbit anti-NaTrxh antibody 
(1:10) at 4°C overnight. The sections were washed 
three times with TBST for 5 min and then incubated 
with 25 nm gold-labeled anti-rabbit IgG (1:10) for 2 h 
at room temperature. Grids were washed three times 
in TBST and then two times in deionized water. The 
sections were stained with uranyl acetate followed by 
lead citrate. Grids were observed and photographed 
with a JEOL 1200EXII electron microscope. 
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Additional files 



Additional file 1: Figure SI. The N- and C-terminal extensions in 
NaTrxh. Protein alignment of various plant Trxs h. NaTrxh N- and 
C- terminal extensions are bolded. The N-terminal extension was split 
into two subregions based on the Hidden-Markov-predicted cleavage 
site: Na covers from the Met-1 to the Ala-16 residues: the N(3expands 
from Ala-17 to Pro-27. 

Additional file 2: Figure S2. Hydrophobicity profiles of the N-termini 
of Napll and NaTrxh proteins. Dotted line: Napll; solid line: NaTrxh. 

Additional file 3: Figure S3. A similar N(3 motif from N. alata (Nala 
[DQ021448]) is found in Glycine max Trxhl and Trxh2 (Gmax [TRX1] 
and Gmax URX2], respectively), which are associated with the plasma 
membrane. The N(3 motif (underlined), essential to lead NaTrxh secretion, is 
conserved in Trxhl and Trxh2 (both associated to the plasma membrane) 
from soybean [60]. 
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